Image Segmentation and Scene Understanding Project

نویسندگان

  • Xiaohe Shi
  • Stephen Gould
چکیده

1. Introduction Scene or image understanding deals with the problem of making a computer " understand " the world behind the image. This can be done in a number of different ways. In this project, we will deal with a kind of problem of scene understanding, semantic image segmentation or pixel labeling. Multi-class image segmentation or pixel labeling does more than the task of object recognition, which just recognizes a particular kind of object. It tries to do object recognition and classify all pixels in an image to particular object classes concurrently. That is, we take an image in and then automatically breaks the image into regions and labels each region with a semantic class label. For example, when the computer reads an image of a person riding a horse on the grass, it can segment the image into several regions with semantic class labels such as person, horse, grass or even sky, etc. (In our project, this is implemented by semantic pixel labeling.) In this semester, my work on the project was divided into 2 phases. In the first phase, I developed a web application on scene understanding where users can easily submit images to the website server and do the state-of-the-art scene understanding work on their submitted images. The server processes the images using a current state-of-the-art scene understanding model and return image segmentation results to the user. Importantly, the web application will maintain a record of all previously processed images so that it can avoid recalculating results on these images (and simply return the results to the user). Also, it can be easily extended to include more and more scene understanding models to make it much more convenient to compare the segmentation results of different models. In the second phase, I extended the current state of the art scene understanding models by incorporating relative location information. This is built on recent work on scene understanding (e.g., [2] and [1]). [1] describes a method of incorporating relative location information of different classes as local features on the superpixels level, and I applied this method on the pixels level. That is, to overcome the challenge of computation on such global information, [1] proposed an approximate two-stage processing (learning and prediction). In the first stage, we still use the current framework and current local features to train the model to get predictions on the pixels' labels. Then, we use the …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modified CLPSO-based fuzzy classification System: Color Image Segmentation

Fuzzy segmentation is an effective way of segmenting out objects in images containing both random noise and varying illumination. In this paper, a modified method based on the Comprehensive Learning Particle Swarm Optimization (CLPSO) is proposed for pixel classification in HSI color space by selecting a fuzzy classification system with minimum number of fuzzy rules and minimum number of incorr...

متن کامل

Object-Oriented Method for Automatic Extraction of Road from High Resolution Satellite Images

As the information carried in a high spatial resolution image is not represented by single pixels but by meaningful image objects, which include the association of multiple pixels and their mutual relations, the object based method has become one of the most commonly used strategies for the processing of high resolution imagery. This processing comprises two fundamental and critical steps towar...

متن کامل

Cooperative Distributed Image Segmentation

The purpose of Image Understanding is to construct a structural description of the scene from an observed image(s). The scene description is generated by Feature Extraction: extract image features from the image, Image Description: construct the structural description of the image in terms of the extracted features, Object Recognition: detect and identify objects in the scene using the image de...

متن کامل

Image Holistic Scene Understanding Based on Global Contextual Features and Bayesian Topic Model

Image holistic scene understanding based on global contextual features and Bayesian topic model is proposed. The model integrates three basic subtasks: the scene classification, image annotation and semantic segmentation. The model takes full advantage of global feature information in two aspects. On the one side, the performance of image scene classification and image annotation are boosted by...

متن کامل

Image Holistic Scene Understanding Based on Image Intrinsic Characteristics and Conditional Random Fields

Image holistic scene understanding based on image intrinsic characteristics and conditional random fields is proposed. The model integrates image scene classification, image semantic segmentation and object detection. 1) For the scene classification, we use method of PHOW feature extraction plus KPCA dimensional reduction to obtain feature information for each image. 2) For object detection sec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013